

FINAL REPORT 

ITOS VHRR ON-BOARD DATA COMPRESSION STUDY 


NASA Contract NAS5-21940 

COUpression^study I ?il ON-BoasD B *TS 


N75* 


. tJncXas 
G3/35 1 8Q5B J 


Systems J/xnalysis 


oy 



1856 Elba Circle 
Costa Mesa, California 92626 


Report No. 75100 
January 1975 


Prepared by: 
R.M. Gray 
L. D. Davisson 




Lee D. Davisson, PhD 
President 




ITOS VHRR ON-BOARD DATA COMPRESSION STUDY 


BY 

R. M. GRAY 
L. D. DAVISSON 

SYSTEMS ANALYSIS COMPANY 
COSTA MESA f CALIFORNIA 


January 1975 



TABLE OF CONTENTS 

Summary. . . . • . ... 

X. Data Compression. System Designs 

X. Introduction . . . 

2. Theoretical Results 

3. System Designs .... 
II. Theoretical Analysis .... 

1. Introduction .... 

2. Data Model .... 

3. Quantization and PCM 

4. Predictive Quantization 


x 



LIST OF FIGURES 


Figure 1.1.1 VHRR Signals ....... 3 

Figure 1.3.1 PCM System ........ 10 

Figure 1.3.2 DPCM System . . . . . . . 11 

Figure II. 4.1 . . . . . • ... . 20 

Figure II. 4. 2 . . . . . . ... • 24 


LIST OF TABLES 


Table 1.1.1 VHRR Timing .... . . . . 4 
Table 1.1.1 Small Capacity Standard Tape Recorder . .. . 7 
Table 1.1.2 Medium Capacity Standard Tape Recorder ■ . . . 7 
Table 1.1.3 ITOS Analog Tape Recorder ..... 7 
Table II. 3.1 Rate vs SNR for PCM . . . . . 18 


ii 



LJCT o r ' rT G ;T I c 

ITOS VHRR ON- BOARD DATA COMPRESSION STUDY 


Figure 1.1.1 VHRR Signals . Summary- • • < • 3 

Figure 1.3.1 PC!.-. Sister. . . . . . . 10 

Data compression methods for ITOS VHRR data have been studied for a tape 

recorder" record -and playback application. A playback period of 9 minuted was 
aSfe&med 1 With-a nominal 18 minute record period for a 2-to-l compression-ratio. 
Bot^ analogy and digital methods were considered with the conclusion that: digital 
methods should be used. Two system designs were prepared. One is a PCM 
system and the other is an entropy- coded predictive -quantization (sometimes 
called entropy-coded DPCM or just DPCM) system. Both systems use data man- 
agement principles to transmit only the necessary data. Both systems use a 
'’medium capacity standard tape recorder” from specifications provided by the 
technical officer. The 10 bit capacity of the recorder is the basic limitation on 
the compression ratio. Both systems achieve the minimum desired 2 to 1 com- 
pression ratio. A slower playback rate (and hence bit transmission rate) can be 
uged, with. the. DPCM system due to a higher compression factor for better. link 
performance at a given CNR in terms of bandwidth utilization and error rate. To 
acht’eVe 1 the s g a in sino r e r c ompl exf logic must be used, ’Fu rth e r g ai n s c oul d be 


achieved, byj irsing.a smaller tap.eirjec.order.ReThc^rep°rt is divided into two parts. 
T^^_^irst part r summari zes .the theoretical conclusions of the second part and pre- 
sents the system diagrams. The second part is a detailed analysis based upon an 
enfp r ri cally d e ri Ve d random process model arrived at from specifications 1 ^ and 


measured data provided by the technical officer. 
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I Data Compression System Designs 

1. Introduction 

The presently existing ITOS very high resolution radiometer (VHRR) 
data utilization capability is highly restricted due to the maximum nine minute 
on-board analog tape storage capability. Much of the VHRR data is highly re- 
dundant, consisting of very flat or periodic segments which are amenable to 
sophisticated data compression processing techniques so that more than nine 
minutes of real time data can be stored at a reduced record speed while 
retaining a nine minute playback capability. A sample of the data output appears 
in figure*I. 1. 1. The data timing information appears in table *1.1.1. Figure 
1.1.1 and table 1.1.1 together with digital tape samples provide the basis for the 
theoretical data model used in the data compression analyses and designs. 

Data compression can be achieved through analog or digital methods. 

Analog methods were considered and discarded early in the study due to the 
overall trend toward digital methods in data communications and the attainability 
of practical data compression with digital methods. 

Two digital tape recorders were used nominally in the study, the specifications 
for which were provided by the contract technical officer. These appear in 
tables 1.1. 2 and 1. 1. 3 and are labelled 11 small capacity 1 ’" and medium capacity” 
respectively and will be referred to as such subsequently. For the purposes 
of comparison, table 1.1.4 presents the comparable specifications for the 
present ITOS analog tape recorder as provided by the contract technical officer . ** 
It is seen that a size advantage is gained by using either digital recorder and 
a power and weight advantage is gained by the small capacity recorder. At the 
18/9 minute nominal record/playback times the medium capacity and analog 
recorders appear to be about equal in power requirements. It will be shown 
that the small capacity recorder can not be used in this application due to the 
10^ bit capacity limitation but that the medium capacity one can be used with a 
large safety margin at the 2 to 1 compression ratio level. 


* Reproduced from "Modified Version of the Improved TIROS Operational Satellite," 
by A. Schwalb NOAA TM NESS 35, dated April, 1972. 

**Report of the Tape Recorder Action Plan Committee," NASA SP-307, dated 
March 21, 1972. 
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FIGURE LI. I . VHRR SignaLs 
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Function 

After Start 
Counts* 

of 1R Precursor 
Time (msec) 

■ Duration 

Counts Time (msec) 

Signal Characteristics 

Precursor 1R 

0 

' 0 

5.2 

1.71 

18.75 kHx squara wave. 0 to 190X 
amplitude aterta 72 + 1° before 
nedlr “ 

• 

•Pre-earth Synchroni- 
sation signal 




> 


Front Porch 

521 

' 1.71 

256 

0.65 . 

Amplitude 0Z 

Sync Pulse 

7 68 : 

■■ 2 ‘ 56 ' 

’ * 'ft 

512 

1.71 

^ i 

Amplitude 100Z, Starts 65.8 +1° 
before the nadir 

Beck Porch 

1280 v 

4.27 V. 

» 256 

0.85 

Amplitude 0Z 

Spec# seen 

1536 ^ 

* 5.12 

Not clock 
controlled 

* 2.08+ 

Amplitude 100Z period varies with 
spacecraft altitude (* attitude 

Earth scan 

. } / 

7.20t • 

Mot clock 
controlled 

45.32f 

Amplitude varies over range 0Z to 
100Z (07. Hot, 100Z Cold). Period, 
varies from 43.68 msec @900 n.ml. 
to 48.64 msec @600 n.ml. 

1st Sub-Sync marker 

4864 

16.21 '. > • 

16 

0.05 

Two level elgnal starting at 01 
amplitude for \ the period when ‘ 

. switching to 100Z for the 
■ remainder of the period 

2nd Sub-Sync marker 

13056 

43.51 

■ 16 . 

0.05 

Same as 1st Sub-Sync marker 

Space scan (post earth) 

Mot clock 
controlled 

52.52 

SB 

4.64t 

Amplitude Is undefined as signal 
may be influenced by the visible 
channel calibration target 

Voltage Calibration 

17152 

57.17 '/ 

'* 1792 

3.97 

Seven levels increasing in ampli- 
tude from 0 to 100Z. Zero level 
following 7th step. 

Visible calibration 

18944 

63.14 

*4 

256 

0.85 

Normal range when target Is Illu- 
minated by sun is sbout 20% amplitude 

Radiance calibration 
300K target 

19200 

63.99 

256 

0.85 

Amplitude about 20X. Varlss with 
target's actual temperature 


TABLE 1.1.1. 


VHRR TIMING 
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Function 

After Start of IX precursor 
Counts* Tlaie (msec) 

Duration 

Count* Tim* (msec) 

Signal Characteristics 

Temperature Senior 
30QK target (expended 
range +20 to +34°C) 

19456 

64.85 

256 

0.85 

For normal temperature range ampli- 
tude will be between 0 and 80X. 

(If the temperature Is out of this 
range, the signal will either be 
0 or between 857. and 100Z) 

Temperature Sensor 
(expanded) range +7 
to +21 C) 

19712 

65.70 

256 

0.85 

Same as above 

Temperature Sensor (ex* 
pended range -6 to +8°t) 

19968 

66.55 ■ 

; ; 236 / ; 

0.65 

Same as above 

Post-earth Synchronisation 



' t A 


Front Porch 

20224 

67.41 

1024 , 

3.41 

Amplitude OX 

Sync Pulse 

21248 

; 70.82 

256 

0.85 

Amplitude 100% ♦ 

Back Porch 

21504 

71.67 

./ V 256. 

0.85 

Amplitude OX , , . ' 

End of IR scan 

21760 

72.53 




Precursor Vis 

225231 

75.00+1.00 

’ 312 

1.71 

I 

12 • 50kHa square wave. 6 to 100X 
amplitude. Starts 72+10 before nadir 

Pre-earth Synchronisation 
Signal 

# 

• * 

i 

S. 

• 

Front Porch 

23035 

76.78 

236 

0.85 

Same as IR signal 

Sync Pulse 

23291 • 

77.63 

312 

1.71 

Same aa 1& signal - 

Back Porch 

“ 23803 

79.34 

256 

0.85 

Same aa IR signal 

Bpaca scan (pra-aarth) 

24059 

80.19 

' Mot clock 2 ,08t 

, controlled . 

Amplitude OX. Period varloe 
identically with IR signal 




TABLE 1. 1. 1 (cont) 


in 



o O 

^ a 

g s 

Ss.l 

C-i 

feb 
C ■« 


Function 

After Start of 
Count#* 

IR Precursor 
Time (msec) 

Duration 

Counts Time (msec) 

Signal Character la tica 

Earth scan 

Mot clock 
controlled 

82.27+ 

. Mot clock 
controlled 

45.32 + 

Amplitude varies over range of OX to 
100X. (OX Black, 100X White). Period 
varies Identically with IR channel. 

lat Sub-Sync marker 

27387 

91.28 

16 

0.05 

Same as IR signal 

2nd Sub-Sync marker 

35579 

118.58 

16 

0.05 

Same as IR signal 

Space scan (post earth) 

Not clock 
controlled 

127.59 

Mot clock 
controlled 

4.64+ 

Same as IR signal 

Voltage Calibration 

39675 

132.24 

1792 

5.97 

Same as IR elgnal 

Visible callb. target 

41467 

138.22 

256 

0.85 , 

Same aa IR elgnal 

Radiance calibration 
300°K target 

41723 

139.00 

256 

0.85 

Same aa IR elgnal ^ , 

Temperature sensor 
(+20 to 34 °C) 

41979 

139.93 ’ 

A',‘ 256 ’ ■ 

0.85 ? 

Same aa IR elgnal 

Temperature sensor 
(+7 to +2l°C) 

42235 

140.78 ' ;j * 

, 256 

0.85 ' . 

Same aa IR signal • ^ 

Temperature sensor 
(-6 to +8°C) 

42491 

141.64 

256 

0.85 

Same aa IR signal 

Fost-earth Synchronisation Signal . 



( 


Front Porch 

42747 

142.49 

1024 

3,41 

Sama aa IR signal 

Sync Pulse 

43771 

145.90 

256 

0.85 

Sana as IR signal 

Beck Porch 

44027 

146.76 

256 

0.85 

Same as IR signal 

Multiple* tolerance 
safety cone 

44283 

147.61 

a» * 

~2.39 


End of Scan - 


150.00 

• 




* A count is one cycle of 300-kHs ■ quart wave. \ 

t Spacecraft at 790 n.ml., 0° roll error, and nominal position for start of precursor 

t Nominal value for awitchovar from IR to Visible Scan. All following times assume that this switchover is nominal. 




TABLE 1. 1. L . (cont) 



Table 1. 1. 2 


Small Capacity Standard Tape Recorder 

g 

Storage Capacity: 10 bits 

Record /playback time: 2. 62 minutes to 28 hours 

Record/ Playback rate: ^ 6.4x10 bps 5 

9 Minute playback rate: ^ 10 bits/540 sec = 1. 9 x 10 bps 
3 

Size: 156 in 
Weight: 10 lbs. 

Power: ^ lOw. 

Cost: Unknown 

Table I. 1. 3 

Medium Capacity Standard Tape Recorder 
9 

Storage Capacity: 10 bits 

Record/playback time: 31/8 minutes to 26 2/3 hours 

Record/ Playback rate: £ 5. 12 x 10 bps £ 

9 minutes playback rate: ^ 10 bits/540 sec. = 1 , 9 x 10 bps 

Size: 468 in 
Weight: 28 lbs. 

Power: 10 to 40 watts 
Cost: Unkown 

Table 1.1.4 

ITOS Analog Tape Recorder 
3 

Size: 1200 in 
Weight: 18. 7 lbs. 

Power: 19. 8 w. (record), 16. 1 w. (Playback) 

Cost 200K 
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2. Theoretical Results 
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The detailed theoretical analysis appears in section II of this report. For 
analytical purposes the signal was assumed to be a uniform random variable on 
the [-6,0] volts range shown in figure 1*1.1 and to be sampled at 10^ samples/ sec 
(1.43 times the Nyquist rate for a 35 khz nominal data cutoff frequency. Using 
these assumptions and a uniform quantizer, the signal -to-noise ratio was 
calculated as a function of the bits of quantization (table II. 3.1). 


The number of bits of quantization times the sampling rate defines the re- 
quired recording rate for a pulse code modulation (PCM) system which stores 
and transmits the bit stream without encoding. For a 2-to-l compression ratio, 
the playback rate is then twice that number. The required total recorder capacity 
in bits is 540 (9 min x 60 sec/min) times this rate. From tables 1.1.3 and II. 3. 1 
it is seen that the medium capacity recorder can be used at quantization levels up 
to around 8 bits (48 db SNR) for a 2 to 1 compression ratio (18 minutes of data). 
Higher compression can be attained at lower quantization levels. 


Further compression can be attained by data processing operations. The 
data are fairly inactive except during the earth scan. By transmitting only the 
earth scan and single values for the calibration- temperature data instead of 
long steps, approximately 40% of the digitized values are removed- a 40% re- 
duction in average bit rate. Thus at 8 bits quantization, from table 1.1.3, 10^ 
bits/(8 bits x 60% x 10 5 samples/sec) =2080 sec. or 35 minutes of data can be 
stored on the medium capacity recorders The small tape recorder, on the other 
hand, c an hold only 3. 5 minutes at 8 bits quantization. Only at 1 bit quantization 
is the 2-to-l compression ratio achieved- a quantization of no practical use. 


From empirical studies and the theoretical results of section II of this report, 
in the 6-8 bit quantization range, about 3 more bits can be removed through entropy 
coded prediction- quantization (sometimes called differential pulse coded modula- 
tion (DPCM). In this scheme the difference is encoded by a variable length code 
b e t W een each successive value and its predicted value based upon past encoded 
values. Using this factor, at 8 bits quantization, 10^ bits^8-3) bits x 60% x 10^ 
samples /sec) = 3333 sec. or 55 minutes of data can be stored on the medium 
capacity recorder. The small tape recorder, on the other hand can hold only 5. 5 
minutes of data. Only at impractically low levels of quantization can the small 
capacity tape recorder be used. Hence, it is recommended that the medium ca- 
pacity tape recorder be used. It is clear that a substantial margin of storage 
could be used to store and replay more than 18 minutes of data if sufficient trans- 
mission channel capacity is available on playback. Alternately a smaller recorder, 
between those of figure 1. 1. 2 and 1. 1. 3 could be designed. It is clear in either case 
that the DPCM system requires a lower transmi sssion rate and/or total capacity for 
a given compression ratio. 



' 3. System Designs 
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Based upon the considerations given in the last section, two systems are 
recommended - a PCM system and a DPCM system. The PCM system is a subsystem 
to the DPCM system. Block diagrams of the two systems appear in figures 
I. 3.1 and I. 3.2. Both systems use the medium capacity tape recorder for a 
weight penalty of 9* 3 lbs. with a size gain of 732 in 3 over the ITOS analog re- 
corder (see tables 1. 1, 2 and 1,1. 3), Power requirements seem roughly equivalent. 

The earth scan data are digitized by the A/D under control of the VHRR 
clock. The digitizing starts after a fixed delay from the start of scan so that only 
the earth data are included. The controller controls the insertion of the 
digitized values into the memory buffer and the output to the tape recorder. As 
the input rate is not constant, the controller also insures a constant output rate to 
the tape recorder and time multiplexes in the telemetry and calibration values. 

On command the recorder plays back the stored values at an increased rate over 
the record speed, the speed depending on the desired compression ratio. 
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Figure I, 3.1 PCM System 
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Figure I. 3. 2 DPCM System 
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II. Theoretical Analysis 
1. Introduction 


This section of the report presents the theoretical considerations involved 
in the choice of a data compression system. A theoretical data model is de- 
veloped based upon empirical measurements and specifications from VHRR 
data provided by the GSFC technical officer. The data model is then used to 
analyze the performance of data compression algorithms in terms of required 
data rate and distortion, measured by mean squared error ( signal- to-noi se ratio). 


2, Data Model 

The source data to be compressed is assumed to be a ’’composite" or "switched" 
source consisting of four subsources. To characterize the composite source 
we describe each sub source and the amount of time spent in each. 

Source 1 : VHRR data with the satellite facing earth 
Source 2: VHRR data with the satellite facing space 
Source 3: Precursors 

Source 4: Synchronization and Calibration data 

Source 1 : The earth VHRR data (the main data of interest) is modelled as a 
random process. In order to well approximate the data and yield a tractable model 
the random process model is assumed to satisfy the following;- 

i) The process is stationary and ergodic so that calculated ensemble averages 
will agree well with measured sample averages. 

ii) The alphabet (range) of the data is the interval [-6, 0] volts as in figure 1.1.1. 

iii) The marginal probability density is uniform on an interval of width 6 volts 

iv) The data is highly correlated over short times. In particular, at the 
sampling frequency chosen to safely surpass the Nyquist rate the correlation 
coefficient is on the order of . 85 - an empirically determined quantity. 

v) The power spectral density falls off fairly rapidly. In particular, the 
3db down frequency is a few khz and the power spectral density is 
ne gligikle at 35kHz, say 20dB down from its maximum value. 

An added assumption that is mathematically desirable and well matches the 
data is the following: 

vi) The process is a Markov process. 

A final, less precise assumption that is often adopted is the following: 
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vii) The second order properties (transition probabilities, covariance, power 
spectral density) should be approximately Gaussian. In particular, the covariance 
function should be well approximated by an exponential, the form of that of a 
Gauss -Markov source. 

A model meeting all of these requirements is a Wiener process between re- 
flecting barriers at -Xand 0. In other words, if y(t) is a Wiener process, than the 
reflected process x(t) is defined as 

x(t) = - | y (t) -2kX | 

where k is such that 

- |y(t)-2k\ | e[-X, 0] 

2 

If the Wiener process has parameter or incremental variance <J , then the trans- 
ition probability density function is given by the "folded” Gaussian density 

f x(t+T)|x(t) (y,X) = 

-(y-x+2kX)^ -(y+x+2kX)^ 

A’ ~z\t\oA ' 2 1 t | a ^ 

2 + 9 

k=-°° 2 ira I t | 2n a, | t | v 

x,y e[-X, 0] (2.1) 

The random process [x(t); t €(- co , ro )} specified by 

f x(t ) x(t ) (x l x n> 

1 n 

=iti £ x(t i+1 Mt.) (x i + i ,x i )f x(t 1 ) (x x ) 

where 



) U(t.) 



is given by 2.1 and 
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f x( tl ) (X l> = X_1 ’ xe[ - X ’ 0] 

is easily demonstrated to specify a stationary ergodic Markov process with 
uniform marginals. Thus the above model satisfies requirements (i), (ii), 

(iii), and (vi). To satisfy (iv) we must choose O' to yield the desired covariance 
coefficient. Since we demand that frequencies up to 35kHz be reconstructed 
and do not care about power at higher frequencies, the Nyquist rate is 70kHz, 

To be both conservative and convenient we choose a sampling frequency of 
100kHz=10^ samples per second. We require that 


r 


A -5 
= P x (10 ) = 


K (10" 5 ) 
K (0) 

X 


= .85 


where 

K (t) =E{(x(t) - x(t))(x{t+T)-x(t+T)} 
x 

is the covariance function, it can be shown, from ( 2 . 1 ) that 


P X (T) 


K <T) 

-K x (°) 



2 T? 71-2 I T l ^ 2n+1 ) 2 

e 

( 2 n+l ) 4 


• ( 2 . 2 ) 


For cr/X= 55.4, r =.85, the desired covariance coefficient. To consider power 
spectral density we take the Fourier transform of the covariance function rather 

than the correlation function to avoid problems with the nonzero mean. It 
follows from ( 2 . 2 ) that the normalized spectral density is 

/ 


V f) 

S x<°> 



( 2 n+l) 


-2 


< 2 n+l) 4 +(^f ) 2 


2 

a tt 


<2. 3) 


At f =2400 Hz, S x (f) | S x (0) = 1/2 and hence the 3db frequency, £ 3,33 - 2400 Hz. 
Furthermore, S x (35xl0^) | S x {0) =.0047 so that the power spectral density is 
down 23dB at 35kHz and hence condition (v) is met. 

Turning to assumption (vii) we note that the covariance coefficient of (2. 2) 
is well approximated by its first term: 



f \ ~ 96 

"X (T) = ~T xp 
TT 
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(2.4) 


-5 


Even for T = 10 the 
Thus the covariance 


The approximation is quite good for moderate to large t. 
first term approximation yields .847 while P x (10 _ ^)= .85. 
of x(t) is well approximated by the exponential form of a Gauss Markov covariance 
function. Next consider the transition density of (2.1)., Approximating this sum 
by only one term would not provide a good approximation because the first term 
would approximate a Wiener process with resulting covariance coefficient r =1 
when in fact r =. 95 by calculation. A better approximation is obtained by 
approximating f .1 by a Gaussian conditional density with the same second 
moments, that i*, X 


f x(t+T)| X (t) <y|x) “ 


expC 2 (l-rW 2 C(y-*)- r (x-x)] 2 } (2.5) 


2tt(1- r )a 


where x = Ex(t) = - \fZ, 

a x 2 = E[(x(t) - x) 2 } 

= X 2 A2 
r = . 85 

To justify this approximation we note that it yields a conditional variance of 


3 ^ - 2 2 . ~ 2 

a x(t + 4(t) =x = - x a-r )= .023X 


regardless of x. The expected value of the actual conditional variance can be 
shown* to be 


E{a xft + T)|x(t) } = a x 2 ( 1 - P x (2T)) 


( 2 . 6 ) 


Evaluating (2. 6) using (2. 2): 


p (2 x l(f 5 ) = .729 


and hence 
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E{<j 2 x(t + T)|x(t ) 3 = - 023X2 
2 

Thus for the true model, <J . )Lx(t)* S a ran ^ om variable (dependent on the value 
of x(t)) with expected value , UZ3A ; For the Gaussian approximation the conditional 
variance | x (t)^ oes no * ^ e P en< ^ on x (^) and has same value. 

2 2 

The fact that E{tf . } = . 023A indicates that the variance is usually small 

in comparison with X and hence the transition density should be fairly symmetric 
about its mean and fairly bell shaped for most x except those near the edges. Thus 
the approximation of (2. 5) should be quite accurate except when x = X or 0, a 
relatively rare occurence. 

The fraction of time spent in this sub source is roughly 2x(45. 32) /150 = 60%. 

Source 2: When looking at space the data is relatively constant at some unknown 
value. In particular, the large jumps visible during the earth scan do not appear. 
Thus this source can be modeled as a very slowly varying stationary drift process 
a(t) (roughly constant over at least 10ms intervals) with a small added random 
component 6(t). Let the subsource w(t) = a(t) + 6(t). From the observed data, 

5(t) never changes by as much as 5v/m. s. , that is. 


Pr[(w(t+lm. s. )- w(t) I s 5] = Pr[ | 5(t+lm. s. )-6(t) 5] =* 0 

and hence since there are 100 samples per m. s. 


Pr[|6(t+T)-8(t) |^.05]= 0 (2.7) 

Because of this slow variation w(t) is relatively easy to compress and (2.1) is a 
sufficient characterization of the process. The subsource w(t) is in effect a fraction 
[2.08 + 4. 64]A50 =4% of the time. 

Source 3: The precursors z(t) can be modeled as stationary square waves with 
frequency 1 ess than 18, 75 kHz. The only required characteristics are that 
the minimum time between jumps: (18. 75 x lO^)-! =5. 3 x 10 ' ^ ^ 5 T, where t 
is the sampling period, and that the crossing time is a uniform random variable 
(from the point of view of the compression system). The latter fact makes the 
square waves stationary. The fraction of time for this subsource is 2(1. 71)/150= 2%. 

Source 4: This sub source includes the pre and post earth synchronization, the 
voltage, visible, and radiance calibration, and the temperature sensor data. We 
do not include the sub- synch markers of the earth scan in this source, but instead 
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consider them as simply rare events in the earth scan (the data compression 
device will locate and compress them as such). This source consists of 
stationary step functions with jumps no closer than . 85 m. s. = 85 T. The jumps 
when they occur may be any value between 6], We assume a uniform dis- 

tribution for jumps. This source is in effect a fraction 


(1-.6-.04-.02) = 34% 

of the time. 

To summarize, the data model, say v(t), appears as below: 

x(t) ^ 

Vq-j. , lt) 

y (t) > W ' 

z(t) — ^ 

Switch position a deterministic time function. 

We assume the four sub sources are independent of each other and that the 
"starting time" of the switch sequence is uniformly distributed over TO* 150m. s. ] 
so that v(t) is a stationary composite source. Furthermore, the marginal 
density for v{t) is approximately uniform and hence Ev(t) =v - -3, =3, 

3. Quantization and PCM 

In this section we study the rate /distortion tradeoff from simple A/D conversion 
using binary pulse code modulation. Let R be the quantizer rate in bits per 
symbol, that is, each^of the 10$ samples per second is uniformly quantized using 
2^* levels. Since v n = v(nt) is uniform, uniform quantization is optimal in the 
sense of minimizing distortion. For a range X=6, rate R, and uniformly distributed 
Vn the quantization error is well known to be 


_p2 

E[ (x(nt)-q(x(nt))) 2 } = = 3 * 2 2R (3.1) 

Define the signal- to- quantization noise ratios as 

2,2 2R 
SNR = a / e = 2 
v q 

where the signal variance rather than power is considered to eliminate the nonzero 
In decibels this is 


mean. 
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’ SNR = 10 log^ (cj 2 /e 2 ) = 20R log 1Q 2 ^ 6Rdb j (3.3) 

sjs g 

The actual data transmission rate is R symbols per second = R/T = RxlO 
bits per second (6ps). Table II. 3.1 .summarizes the rate /distortion tradeoff for 
simple PCM. 

Table II. 3.1. Rate vs SNR for PCM 
R R* e 2 SNR(db) 

q 


0 

0 

3 

0 

1 

1x10 

3/4= . 75 

6 

2 

2xl0 5 

3/16 =. 988 

12 

3 

3xl0 5 

3/64 = .047 

18 

4 

4xl0 5 

3/256 = . 012 

24 

5 

5x10^ 

3 A0 24 = 2. 92xl0' 3 

30 

6 

6x10^ 

3/4096 =7. 32xl0“ 4 

36 

7 

7x10 5 

3A6384 = 1. 83xl0~ 4 

42 

8 

/ 8x10 5 

3/65536 = 4. 58xl0" 5 

48 

R 

=bits/ sample 




R# = bits / sec 

€q 2 =avg. quantization noise power 


Suppose the design requirement for compressed data is to be equivalent in 
quality to 6 bit PCM or SNR = 36 db.. Thus in any compression system producing 
an approximation v n ^ to v n we require that 

E t(v -V *) 2 } S 7.32X10' 4 
n n 

order is to playback data at tw^ce the recording rate, the required 

bit rate of a tape recorder is thus 2 RxlO bps. Thus for 36 db a 1,2 x 10^ bps 
tape recorder is required if standard PCM is used. This is well within the range 
of the medium standard satellite tape recorder with 1. 9 xlO^ bps but is much greater 
than the rate of tjhe lighter weight small standard satellite tape recorder with 
capacity 1.9x10 bps. If the small tape recorder is to be used, therefore, approxi- 
mately 6:1 further compression would be required. 
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As a final note, nothing can be gained by noiseless coding the PCM data 
using a Shannon- Fan o or Huffman code since the entropy of the uniformly 
quantized data H(q(v n )) = R since the v n are uniformly distributed. Further 
gains(ft?40%) could be achieved, however, by separating the earth scan data 
from the other, slowly varying data. 

4. Predictive Quantization 

In a predictive quantization scheme we transmit quantized error samples 
between the actual signal value and a prediction of the signal value. It is 
easily shown that one step linear prediction is nearly optimal for 

the x(t) sub source and hence we consider only simple linear prediction. If 
slight improvement is desirable, it is not clear whether two step linear 
prediction or the first two terms of the optimal nonlinear estimate 
is superior^. ' 


We shall design the compression scheme primarily for the most random 
subsource x(t), but we shall see that it works quite well on the composite 
source v{t). 


Gi ven th e discrete -time process x n = x(nt) with r = p x (t) = E{(x n ~x n ) 

(x -x ,)} /v v , the optimal one- step linear predictor is given by 
n-1 n-1 


x (x ) = r x + x (1-r) = rx - — (1-r) 
n n-1 n-1 n . n-1 2 


Define the error sequence e n = x n -x n (x n _ i) 
x n - rxjj.j + \/2 (1-r) and note that e n = 0 anda% = a e =Ox(l-^)» 

, We have by rearranging terms that 


.n 




v 


n-k X- 


k=-« 


(4.1) 


so that the error sequence determines the data sequence. We note that (4.1) is 
essentially an asymptotic or steady- state equation assuming the prediction 
to be done starting with the remote past without periodic initialization as is 
frequently used in practice. 

The sequence £e n 3 has two advantages over the original data sequence from 
the standpoint of data compression. First, the marginal density for e n is 
more M bunched" since = a x ^(l-r^) <a x ^. This allows for further compression 
using either noiseless coding of the quantized error to take advantage of the 
reduced entropy or by non-uniform quantizing matched to the error sequence. 
Secondly, the error sequence is less correlated than the data sequence so that 
single symobl compression is nearly optimal. The decreased correlation 
follows, for example, since 
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|E(e k e k+1 ) | = !(l+r 2 )K x (T)-r[K x (0) + K^t)]) 

I 

1 2 r 3 _ -j ~ 2 3 3 

= a [r -rp (2 t.j = cr ( r - r ) = 0 . 

X 'X X 

We note that if {x^ was actually Gauss -Markov with parameter r, then 
Px(2t) h and the above expression is exactly zero. 

The proceeding discussion motivates the following system 



Decoder 


Figure II. 4. 1 
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is, -have simply replaced the .errors by the -quantized -error sequence 
q((^), IThe prediction is based -on the quantized .error s 


x = sTxx ■+ 
m m-l 



* 

2 


: ]C ^ (E k )l ‘ n ' k - 2 

00 


« 4 . 2 ) 


3so tth&t 


-»* 

3 C 

3 n 


sc -+ <g(£ ) 
m xn 


i$ tfhs approximation process. 'IChse taverage -distortion is 


'O 
£ 4*1 

fx -at ) } = 


)\Tl 

L m xn m ^ m 


= Hffier- «icr#} 

HI :n 


»inee q(e n ), TDhat is, tthe data distortion -.equals tide quantization 

distortion iin -the -error sequence . it xis cditen a. c s sunxed ithat for sufficiently fine 
quan t x za t ion , 'ehis. probabalisticaily 'Similar Ito "the sequence -£n** fChe complicated 
•- feedback relation makes -exact-analysis difficult , hut \we can obtain ;a .good 
approximation found ^as follows^ We have 


m-%) 


e — x -(rx - “ (X-r)) 
:n :n n-L 2 


!e = x ^ x = x - (rx — *r (1-r) ) 
:n :n :n :n n-.l -2 


of PAGE 18 

0P POOH QUALIIY 


=3X-ifcGx ,'+cq{e --).] -^r(X-:r)} 
xn :n-d . :n-=l -2 


< , 
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so that 


e - e 
n n 


rx -r [x _+ q (e )] =r(e -q(e ,)] 
n-X n-1 n-1 n-1 n-1 


Since the processes are stationary we have that 

E{(e -e ) Z 3= r Z E{(e -q(e )) Z ] (4.4) 

n n n n 


so that as often conjectured, = e n in a squared- error sense if the quantization 
error E{ (en-q(^n)) j i- s small. Using the fact that the densities are nearly ^ 
symmetric about their mean so that with symmetric quantization E(q(e)) = 0 =E(e) 
we have from above that 

a~ 2 = E[e’ 2 } =a 2 + r 2 E[(T -qfe )) 2 } + 2rE(e I'e -q{e )]} (4.5) 

€ n e n n n n-1 n-l 

v_ Intuitively, since s' _ and q(T ' ) 

n-1 n-1 

depend only on £n-k ^ or k ^1 and are therefore nearly uncorrelated, the cross 
term above should be small so that 

2 i "w 2 2 r ~ rw 2k . 

.= a +r E{(g -q(e ) )j (4.6) 

e e n ^ n - 


The variance can be bounded above via the Cauchy inequality to obtain 


cn. Z *(cj + r[E{(e -q(0) Z } ] 1/Z ) Z (4.7) 

w b n n 


but this bound seems overly conservative and we instead use (4, 6). 

Since e n = (x n - X/2) - r(x n „j - A/2), the Gaussian approximation of the transition 
density of (2. 5) implies that 


f e |x (5 |x) = f , (?- V2 + r(x-\/2)|x) 

n n-l x x 

n n-1 

^Ziro (1 -t) \}° (1-r ) J 


(4. 8) 
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If the quantization E[(e -q(e)) } is small enough, then ^ shogld also be 
approximately Gaussian, but with possibly larger variance 0" e . This approximation, 
however, is not required in some of the analyses discussed and should be used 
with caution. 

We first consider obtaining compression using uniform quantization followed 
by noiseless coding. We begin by noting that the errors are in general small 
so that with high probability \%x\ Firstly, in all the observed earth scan 

data the jumps never exceed 3 volts in one sample. ^Secondly, if is approximately 
Gaussian with variance a%* , then pr(|e n | ^ X/2 ) ^ 2Q {X/Zo^, = 2Q (3. 3) « 0. 

a negligible amount. Thus if we uniformly quantize e n inthe range [-X/2,X/2jat 
rate R* we obtain via standard approximation that- 


EC's -qfe )) 2 } ~}?Z~ 2R (4.9) 

n n 12 

so that 


E{ (x -x *) 2 }~ A 2 2 -2R' (4.10) 

n n — z 

12 

if is reconstructed from the q(e: n }. We note ^ at froin (4.6) and (4. 9) we have 
that 


2 ~ 2 
a,-* - a + 

e e 


2 y 
12 


-2R 1 


so that if R 1 ^ 5, the second term is 


^ <v o 

negligible in comparison with the first so that = a G as desired 


We next attempt to achieve compression by noiseless encoding the quantized 
error sequence. Now, however, we wish to make use of the fact that it is the 
composite source v n we wish to compress rather than x n . To accomplish this 
we make a slight modification of the system. The principal difference between 
x n and v n is the presence of relatively rare but large jumps in the latter. Thus 
if we use the same prediction rule for v n as we developed for x n , we test to see 
if |*e n | >X/2. If l^nl £ V2 we proceed as above and uniformly quantize on 
[-X/2, X/2] and then noiselessly encoded the 6-bit symbol^ using a variable -length 
technique such as Huffman or Shannon-Fano coding. If |e n | > X/2, however, then 
we instead send a 6 bit quantized version of the actual data symbol 


— ...... x.. . e. [ t X, 0] 

n 


with an appropriate prefix to flag the presence of a data sample instead of an error 
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For the modified system we have as before that 


E[(v - v "l 2 } = ^ 2' 2R (4.11) 

n " 12 

We next consider the attainable bit rate in bits per symbol. Let N be a random 
variable denoting the number of bits required to encode each sample (either 
by quantizing the error or by quantizing the sample itself). The average rate 
in bits per symbol is then E[N] which by iterated expectation is 


E[N> E[Nj v -X ]Pr[v =x ] + 
n n n n 

E[n|v = w 3 Pr[v = w ] 4* E[N]v = z ] Pr[ v = z ] 
nn nn nn nn 

+ E[n|v =y ] Pr[v = y ] . 
n n n n 


Let K denote the number of prefix bits attached when v n is quantized (typically 4). 
We assume that a universal noiseless code is used, e. g. , that empirical fre- 
quencies are used for the relative probabilities* Such a code will then work 
nearly optimally regardless of the sub source* . 

If v - z n , either e n = 0 or e = 6 so that 
n ^ 


E[n|v -z ]= (6+K) Pr[e ^ X/2|v =z ] = (6+K) /5 
nn n ! nn 

so that 


E [nJ u = z ] Pr [ u = z ] 
n n n n 

= (6+K)(. 2)(. 02) 


=(6+K)(. 004) 
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To compute E[n|v = x ] we use a standard approximation which is accurate 
for moderate R r : n 


H (q(~ n ) I v n = * n ) = 


H(e_ |v_= xj - log 2 X2 


-R 


n n n 


= R 1 - log-X + H(e* | v = x ) 
°2 n n n 


where H denotes the marginal differential entropy for e (conditioned on v^=x^). 
H is bounded above (and approximated by) the differential entropy of a Gaussian 
variable with the same variance: 


H(q(e ) v .= x ) — 
n n ' n n 


i 2 

R -log_X+ 1/2 log 2rTea~ 

u b C 

f n 2 

1 , Z neCT~ 

R + 1/2 log ? — — e 

Z X 2 


= R - , 67 


(4. 12) 


with a— = . 023 X 
e 


E[nI'v — x ] Pr[u =x ]= (R - ,.674) (.6) 
n n n n 


Usin^a similar approximation as above on w’ and the fact that 
Pr[ | e n | ^ . 01 | v = wj= 0 so that 


c£. S (.05) Z 

e hr 


H(q(£ ) | v = w ) ^ 


n n n 


R +1/2 log 2 2TTea^j 2 w A 2 = R*-5 
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so that 


E[N|v = w ]Pr[v =W ] =(R-5)(. 04) 

* n n n n 

Finally, consider the sub source v n of jumps. Jumps can occur no more 
often than once every 85 samples and of those jumps no more than 1/2 exceed 
A/ 2, except for small A, Thus 


E{n|v =y } = E{n|v = y , |~ I >A/2} ; • 
1 n n - n n n 


PrC |~ n l s ^ ^ v n = y n ^ + Efc.|v n = y n » |e n l fi ^/ 2 } » 
PrCf? n | sx/2 | v n =y n } 


= (R+K) 1/2 . 1/85 + E{N| v = y , |~ | * \/2}xl/2 

n n n 

= 1/2[H(? |v =y, |e |) .. + R- log, 6] 

n n n n . c 

for moderate K 


where 


, 2TTeO‘~t 

^[R + 1/2 log ] 1/2 

X 


a~i = E{e |v =y , |e | £ A/2 
ejv n ' n 7 n 1 n' 


If 


v = v , | e | = |v - r v | = |l-r| |v _ | ^ (. 05) 
n n-1 n n n-1 ' * n-1 


and this occurs at least 84/85 of the time. 


If v n v n-l* 


then there is a jump 


. xi-j,' 

which is by assumption uniformly distributed over [-A/2, A/2] so that 
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CT ~ I ~ (. 05) 2 84/85 + 1/2 1? 1/85 = . 001 \ Z 

e v 


so that 


E{N | v = y } * 1/2 {R 1 + 1/2 log 
n ti 


2 rr e c? ~ 
e 


= 1/2 [R’ - 5. 9] = R’/2- 3 


Combining the previous relations we have for an efficient universal code 

that 


E[N]~ (6 + K) (.004) 

+ ( R f - .674) (.6) + (R« -5) (.04) 

+ (R'/2- 3) (. 34) 

The first and third terms are negligible (since so little time is spent in those 
sub sources and since they are highly redundant). The expression becomes 
therefore 


E[N] ~ ( R' - . 674) (. 6) + 
(R f /2-3) (.34) 


The second term is likely somewhat conservative since a Gaussian bound was 
used and the data is quite non- Gaussian. The first term is likely more accurate 
since when x is in effect,' is approximately Gaussian. 
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To summarize: 

i 


E { x -x *) 2 } 
n n 



R = E [N ] = (R' - . 674 ) (.6) + (~ - 3) (. 34) 

where a buffer of roughy (lbO)x(lOO) samples is required so that the output 
bit stream will have rate ~R. In particular, for R' = 6 bits and l = 6 so 
that the entire range is quantized. 


as desired, and 


E [(x -x *) 2 ] = 7.32 x 10 4 
n n 

R =* 3. 20 


The same compression can be achieved by synchronizing on the earth 
scan and transmitting only that data with the calibration and telemetry 
data multiplexed in. In this case, further compression can be achieved by 
quantizing a narrower range of values. Empirical studies indicate that the 
earth scan values are confined to approximately 25% of the total possible 
range so that R' =R-2 bits is adequate. Combined with the variable length 
coding factor, an overall reduction of approximately 3 bits is then possible 
for R= 6 to 8 bits. 

Data compression with predictive quantization can also be achieved using 
nonuniform quantization of the error sequence. This scheme has the advantage 
of being fixed rate, but the disadvantage of depending crucially on the model 
for choosing the quantization intervals. The noiseless coding, however, can adapt 
to the varying sub sources. In addition, in some calculations involving the x 
sub source alone, the nonuniform quantizer appeared comparable but slightly n in- 
ferior to the noiseless-coded scheme and hence this system was not studied 
further. 



